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We have performed a search for the lepton-flavor- violating decay r — > /ry using a data sample of 
86.3 fb _1 accumulated by the Belle detector at KEK. No evidence for a signal is seen, and we set 
an upper limit for the branching fraction of B(r — > fij) < 3.1 x 10 -7 at the 90% confidence level. 

PACS numbers: 13.35.Dx, 11.30.Fs, 14.60.Fg 



The decay r —> fi^y violates lepton flavor conservation 
and is forbidden within the Standard Model (SM). How- 
ever, some supersymmetric models, left-right symmetric 
models, and others predict a branching fraction in the 
range 10 _T to 10 -9 , which is accessible at an e + e~ IB- 
factory. It is notable that B(r — > /ij) could be enhanced 
by a factor of 10 5 -10 6 relative to B{ji — > ej) : because rel- 
evant kinematical factors depend on powers of m r /ra M . 
The decay r — > fij is thus a promising process in which 
to search for new physics. 

This decay has previously been searched for by MARK- 
II 0, ARGUS 3], DELPHI 0, CLEO 00, and BaBar 
0. The most sensitive upper limit, reported by CLEO, 
is B(r /17) < 1.1 x 10- 6 at 90% C.L. 

We present here our study using 86.3 fb _1 of data col- 
lected at the T(45) resonance with the Belle detector at 
the KEKB asymmetric e + e~ collider^. A description 
of the detector can be found in Ref. 

We search for an event composed of exactly two 
oppositely-charged tracks and at least one photon can- 
didate, which is consistent with a r + r~ event in which 
one r decays to /17 and the other r decays to a non-muon 
charged particle, neutrino(s) and any number of 7's. 

The selection criteria are determined by examining 
Monte Carlo (MC) simulations for signal r-pair decay 
and background events (BG), such as generic r-pair de- 
cay (r + r~), qq continuum, BB, Bhabha, and p + fi~ as 
well as two-photon processes. The KORALB/TAUOL A 
flo| and QQ H l[ g enerators are used for event generation, 
and GEANT3 |l2[ is used to simulate the Belle detector. 
The two-body decay r — > /ij is initially assumed to have 
a uniform angular distribution in the r's rest system; 
possible deviations from this are considered later. 

The selection criteria are chosen to maximize the signal 
sensitivity. Kinematic variables with a CM superscript 



are calculated in the center-of-mass frame; all other vari- 
ables are calculated in the laboratory frame. Each track 
is required to have momentum p CM < 4.5 GeV/c and 
momentum transverse to the e + beam p t > 0.1 GeV/c. 
Both tracks are required to be within the acceptance of 
a muon identification system (KLM): —0.819 < cos# < 
0.906, where 6 is the polar angle with respect to the e + 
beam direction. Muon candidates are identified via a 
muon relative likelihood pjj, which is based on the 
difference between the range calculated from the parti- 
cle momentum and the range measured by the KLM. 
It also includes the y 2 formed from the KLM hit loca- 
tions with respect to the extrapolated track. The charged 
track that forms a r — > fij candidate is required to have 
p > 1.0 GeV/c and £ M > 0.95. The other track (on the 
"tag-side") is required to have £ M < 0.80; i.e., we require 
that it not be a muon to reduce e + e~ — > /i + /i~7 back- 
ground (7 from initial-state radiation). The fraction of 
muons with £ M < 0.80 is denoted as 77 and is measured 
to be (for a tag-side muon) (12 ± 3)%. 

Photon candidates, whose definition is given in 
Ref. 14], are required to be within the acceptance of an 
electromagnetic calorimeter (ECL): —0.866 < cos# 7 < 
0.906. The photon that forms a r — > /ij candidate is re- 
quired to have an energy E 1 > 0.5 GeV in order to avoid 
a spurious combination of a low-energy 7 with the muon. 

A cut E^J^ < 9.0 GeV is imposed to reject Bhabha 
scattering and production, where E^J^ is defined 

as the sum of the energies of the two charged tracks and 
the photon composing the fij. A restriction on the open- 
ing angle between the \i and 7, 0.4 < cos 6^:^ < 0.8, 
is particularly powerful to reject background events aris- 
ing mostly from e + e~ — > r + r~, r — » tt°X. This pro- 
cess forms the backward peak in the open histogram in 
Fig. [Ha). The opening angle between the two tracks is 
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FIG. 1: Distributions of the opening angle between (a) the fi 
and 7 on the signal side, and (b) the tagging track and the 
missing momentum. MC distributions for signal and t + t~ 
events are indicated by shaded and open histograms, respec- 
tively, and data by closed circles. The arrows show the se- 
lected ranges. 



required to be greater than 90°. 

We define Pmiss as the residual momentum vector cal- 
culated by subtracting the vector sum of all visible mo- 
menta (from tracks and photon candidates) from the sum 
of the e + and e~ beam momenta. We include all photon 
candidates with energy greater than 0.1 GeV in the p m iss 
calculation. Constraints on the momentum and polar 
angle of the missing particle(s), p m iss > 0.4 GeV /c and 
-0.866 < cos m i ss <C 0.956, respectively, are imposed 
to increase the probability that the missing particle (s) 
is an undetected neutrino(s) rather than 7's or charged 
particles falling outside the acceptance of the detector. 
To remove r + r~ events, we apply a requirement to the 
opening angle between the tagging track and the missing 
particle, cos0^J^_ miss > 0.4, as shown in Fig. E^b). 

Next, a condition is imposed on the relation between 
Pmiss and the mass-squared of a missing particle (ra^ ss ). 
The latter is defined as E^ niss — p miss1 where E m i SS is 
11.5 GeV (the sum of the beam energies) minus the 
sum of all visible energy and is calculated assuming the 
muon (pion) mass for the charged track on the signal 
(tag) side. We require Pmiss > -5 x m^ niss - 1 and 
Pmiss > 1.5 x m^ iss - 1, where p miss is in GeV/c and 
rrimiss is in GeV/c 2 . This cut loses 24% of the signal 
but removes 98% of the remaining r + r~ background (see 
Fig. El and 86% of the background. 

After these selection requirements, 713 events remain 
in the data without any restriction on the mass and mo- 
mentum of the fij system. The detection efficiency is 
evaluated by MC to be e = 12.0 ± 0.1%. 

The candidate fij system should have an invariant 
mass (Mi nv ) close to the r lepton mass and an energy 
close to the beam energy in the CM frame; i.e., AE = 
— E^^ rn ~ 0. When deciding on our selection crite- 



ria, we excluded the signal region 1.70 GeV/c 2 < M inv < 
1.85 GeV/c 2 so as not to bias our choice of criteria (a 
"blind" analysis). Only after all cuts were finalized and 
the number of background events estimated did we in- 



FIG. 2: Distribution of events in the ra^ iss -p m i ss plane. The 
selection boundary is indicated by two lines for (a) signal MC 
events and (b) t + t~ MC events. 
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FIG. 3: Remaining events in data (circles) and the expected 
density for signal MC (shaded) in the AE-Mi nv plane. The 
region between the dashed lines is kept excluded until the 
selection criteria are finalized and the expected background 
estimated. The signal box (defined in the text) is indicated 
by a dash-dotted box. 



elude this region and count the number of signal events. 
The resultant AE vs. Mi nv plot is shown in Fig. |3l For 
comparison, the equivalent plot for MC r — > fij decays 
is also shown. Because of the photon's energy leakage 
from the ECL detector and initial-state radiation, the 
MC simulation exhibits a long low-energy tail across the 
AE-Mi nv plane. The individual AE and Mi nv distri- 
butions around the peak are reproduced by asymmet- 



ric Gaussians with a 



MeV and a 



low /high 

M inv 



AE 



= (75.4 ±0.7)/(33.7± 0.4) 
(23.1 ± 1.2)/(15.0 ± 0.6) MeV/c 2 , 
where (j low / hl 9h me ans the standard deviation at the 
lower/higher side of the peak. The peak positions are 
-1.1 ± 0.5 MeV and 1776.8±1.0 MeV/c 2 for AE and 
M inv , respectively. 

We define a AE-Mi nv region to evaluate the number of 
signal candidates. The signal box is denned as the area 
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within ±3<r for both AE and M inv : -0.23 GeV < AE < 
0.10 GeV and 1.71 GeV/c 2 < M inv < 1.82 GeV/c 2 . The 
acceptance (Q) for signal events passing all previous cuts 
is 87.3%. 

There are two dominant sources of background: 
e + e~ — > /i + /i~7 and e + e~ — > t + t~7, in which the 
photon is radiated from the initial state. In the former 
case the muon on the tag side is misidentified; in the 
latter case the muon on the signal side originates from 
r — » \±vV decay. We hereafter denote the former back- 
ground as /Ji/ij and the latter as rrj. The rrj back- 
ground is studied using a 150 fb _1 sample of MC r + r~ 
events. The ^7 background is studied using data by 
requiring that both tracks be muons and applying the 
muon inefficiency (77) to the tag-side track. For this se- 
lection, the signal (tag)-side muon is required to have 
£ M > 0.95(0.80). Within the whole AE-M inv region 
shown in Fig. 03 the rrj process yields 90.6±7.2 events 
and the /Jtfij process 43.4±12.4 events. Among the other 
background processes mentioned above, only the con- 
tinuum background yields a nonnegligible contribution 
(12.7+5.7 events). This background has a rather large 
uncertainty as it is evaluated using an MC sample cor- 
responding to only 34 fb _1 . The expected backgrounds 
amount to 143.7+15.5 events in total, where 3.0 events in 
the iJt/jLj sample that are estimated to originate from rrj 
are subtracted to avoid double-counting. The number of 
data events in this region (now including the previously- 
excluded region) after all selection cuts is 160+13. This 
yield is consistent with the background estimate. 

The distributions of rrj and /Jt/ij background events, 
N TT1 (M inv ,AE) and N^(M inv ,AE), exhibit different 
behavior in AE. The former populates only the negative 
AE region, while the latter is distributed mostly in the 
positive AE region. Both types of background events, 
however, exhibit a similar correlation in the AE-Mi nv 
plane and are empirically reproduced by a combination 
of Landau and Gaussian functions in AE, and a linear 
function in M inv . The background distribution can then 
be represented by the sum of two BG components as 



N BG (M inv ,AE) 



N T ^(M invj AE) x (1 + A) 
N^(M inv ,AE)xK, (1) 



where A is the continuum background contribution. This 
distribution is taken to be similar to that of TT7, as in- 
dicated by Monte Carlo simulation. The factors (1 + A) 
and k = 77/(1-77) are 1.14 ±0.09 and 0.14 ±0.04, respec- 
tively, where the former is obtained from MC and the 
latter is obtained by measuring the muon inefficiency (77) 
for e + e~ — > events. 

The above background parametrization is compared 
with the data outside of the excluded region. Fit- 
ting the background shape to the data gives 1 ± A = 
1.22 ± 0.13 and k = 0.11 ± 0.03, which agree well with 
the values given above. For both sets of 1 ± A and 
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FIG. 4: AE distributions in the signal region, 1.71 GeV/c 2 < 
Mi nv < 1-82 GeV/c 2 . The expected background is indicated 
by the solid (dashed) curve for 1 + A = 1.22(1.14) and k — 
0.11(0.14). The histogram is an average of both sideband 
spectra. The data are the points with error bars. The vertical 
dashed lines indicate the AE range of the signal box. 



n values, Fig. 0] shows the expected background events 
N B G(M inv ,AE) as a function of AE for 1.71 GeV/c 2 
< Mi nv < 1.82 GeV/c 2 . No appreciable difference is 
seen between the two NBG(Mi nv , AE) distributions, and 
we take the latter values for 1 + A and k. As the back- 
ground has a linear (and in fact small) dependence on 
M invi the amount of background in the signal region 
can be estimated by averaging the number of events 
counted in the Mi nv sidebands 1.553 — 1.663 GeV/c 2 and 
1.850 — 1.960 GeV/c 2 . These sidebands begin a distance 
^M inv from the edges of the signal region so that no po- 
tential signal biases this estimate. When the events in 
the signal box are included, the data event distribution 
is found to match the expected NBG(Mi nv , AE) well (see 
Fig.QJ. Within the signal box, 19 events are found in the 
data while 20.2 + 2.1 events are expected from eq.Q and 
20.5 ± 6.4 events from the average of M inv sidebands. 

An upper limit is obtained using the frequentist 
method described in Ref. 0, The likelihood func- 

tion is defined as 



-(s+6) 

Nl 



N 



HiaSi + bBi), 



(2) 



where N is the number of observed events, and s and b 
are the number of signal events and background events, 
respectively. Si and Bi are the signal and background 
probability density functions, where i indicates the i-th 
event. Si is obtained by generating 100 x 10 6 MC signal 
events, and Bi is taken from eq.Q. We apply this fit for 
s and b to a ±5<r region in Ai? and Mi nv , which has an 
acceptance (Q) of 93.1%. There are a total of 54 events 
in this region, and, when s is constrained to be >0, the fit 
finds s = 0.0 and b = 54.0. The y 2 of the fit projection in 
AE is 7.90 for 10 bins, while in Mi nv it is 5.57 for 10 bins. 
These values correspond to confidence levels (evaluated 
via toy MC) of 0.66 and 0.86, respectively. 

To calculate the upper limit, Monte Carlo samples are 
generated by fixing the expected number of background 



5 



events (b) to the value b = 54. For every assumed ex- 
pected number of signal events (s), 10,000 samples are 
generated, for each of which the numbers of signal events 
and background events are determined by Poisson statis- 
tics with means of s and b, respectively. We then assign 
AE and Mi nv values to these events according to their 
density distributions. An unbinned maximum likelihood 
fit is performed for every sample to extract the number 
of signal events (s MC ). The confidence level for an as- 
sumed s is defined as the fraction of the samples whose 
s MC exceeds s. This procedure is repeated until we find 
the value of s (sgo) that gives a 90% chance of s MC being 
larger than s. 

The resulting upper limit at 90% C.L. is §90 = 5.1 
events. An upper limit on the branching fraction is ob- 
tained via the formula: 



B{t -> /i 7 ) < 



S90 



2(eQ)N TT ' 



(3) 



where N TT is the total number of r-pairs produced. In- 
serting the values s 90 = 5.1, e = 12.0%, Vt = 93.1% and 
N TT = 7.85 x 10 7 gives B(r fij) < 2.9 x 10- 7 [f]. 

To take into account systematic uncertainties related 
to sgo, (1 + A) and k are varied by ±la each. This af- 
fects sgo by +0.06/— 0.11 events. The uncertainties in 
the functional forms of the background events are eval- 
uated by varying the most sensitive parameters by their 
evaluated errors. The functional form is broadened or 
shortened by 1.2 or 0.8 times for N TT ^(M inv , AE) and 
by 1.4 or 0.9 times for N^(M inv ,AE), and their cen- 
ters are shifted by ±0.02 GeV for N rr ^(M inv , AE) and 
by ±0.015 GeV for N^(M inv ,AE). These factors are 
about the largest that still give an acceptable fit to the 
background distributions. The shift of the central value 
for the N TT1 (M invi AE) spectrum yields the largest ef- 
fect of ±0.2 events, and the overall systematic uncer- 
tainty is evaluated as ±0.3 events. The stability of the 
result as the fit region is varied is checked by extending 
the AE-Mi nv region to ±8a; s = 0.0 and b = 105.0 are 
obtained, and §90 varies by only 0.07 events. 

The systematic uncertainties on the detection sensi- 
tivity 2(eQ)N rr arise from the track reconstruction ef- 
ficiency (2.0%), photon reconstruction efficiency (2.8%), 
selection criteria (2.2%), luminosity (1.4%), trigger effi- 
ciency (1.6%), and MC statistics (0.8%). The total un- 
certainty is obtained by adding all of these components 
in quadrature; the result is 4.7%. This uncertainty is in- 
cluded in the upper limit on B(r ^ fij) following p[fl. 



[f] Using the Bayesian approach described by the PDG [HI, we 
obtain a 90% C.L. upper limit of 7.9 for the number of r — ► fij 
events. Inserting this value into eq. |3J along with e = 12.0%, 
Q = 87.3% gives B(r ^7) < 4.8 x 1CT 7 . 

[X] The quantity l/(2(eQ)N TT ) is integrated assuming a Gaussian 
distribution for 2(eQ)N TT . 



The angular distribution of the r — > fi^ decay essen- 
tially depends on the LFV interaction structure [ljj , and 
spin correlations between the r's at the signal and tagged 
sides must be considered. To evaluate the maximum pos- 
sible variation, V — A and V+A interactions are assumed; 
no statistically significant difference in the AE-Mi nv dis- 
tribution or in the efficiency is found compared to the 
case of the uniform distribution. 

The incorporation of all systematic uncertainties in- 
creases the upper limit by 6.3%, of which the effect of 
the NBG(Mi nv , AE) uncertainty dominates. As a result, 
the upper limit on the branching fraction is 



B(t fij) < 3.1 x 10" 7 at 90% C.L. 



(4) 



This result is lower than previous limits for this mode 
and helps constrain physics beyond the Standard Model. 
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